RESERVOIR COMPUTING WITH TRUNCATED NORMAL DISTRIBUTION FOR SPEECH EMOTION RECOGNITION
نویسندگان
چکیده
Speech is an effective, quick, and important way for communicating exchanging complex information between humans. Emotions have always been a part of normal human conversation which makes the speech more attractive. Because this major role both emotion, many researchers are inspired by studying Emotion Recognition (SER) still has plenty challenges. In study, we proposed novel reservoir computing approach with initialization random connection weights input weight truncated distribution. Furthermore, Population-Based Training (PBT) adopted to optimize hyperparameters whole Echo State Network (ESN) model significant impact on performance. The bidirectional increase memorization capability, Sparse Random Projection (SRP) was applied dimensional reduction as simple, unsupervised, low complexity approach. speaker-independent strategy employed EMODB SAVEE datasets acted emotion dataset Aibo non-acted dataset. achieved 84.8%, 65.95%, 45.99% unweighted average recalls EMODB, SAVEE, respectively. results show that outperforms recent state-of-the-art studies cheaper computational cost.
منابع مشابه
Speech Emotion Recognition with Emotion-Pair Based Framework Considering Emotion Distribution Information in Dimensional Emotion Space
In this work, an emotion-pair based framework is proposed for speech emotion recognition, which constructs more discriminative feature subspaces for every two different emotions (emotion-pair) to generate more precise emotion bi-classification results. Furthermore, it is found that in the dimensional emotion space, the distances between some of the archetypal emotions are closer than the others...
متن کاملNeuro-Inspired Speech Recognition Based on Reservoir Computing
This chapter investigates the potential of recurrent spiking neurons for classification problems. It presents a hybrid approach based on the paradigm of Reservoir Computing. The practical applications based on recurrent spiking neurons are limited due to the lack of learning algorithms. Most of the previous work in the literature has focused on feed forward networks because computation in these...
متن کاملFuzzy Truncated Normal Distribution with Applications
In this paper, fuzzy truncated normal distribution is shown. Also, some examples and applications are presented.
متن کاملSpeech Emotion Recognition Using Scalogram Based Deep Structure
Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Malaysian Journal of Computer Science
سال: 2022
ISSN: ['0127-9084']
DOI: https://doi.org/10.22452/mjcs.vol35no2.3